Name | Version | Summary | date |
journ4list |
0.11.0 |
A powerful async news content extraction library with modern API for web scraping and article analysis |
2025-07-24 21:57:44 |
palimpzest |
0.7.20 |
Palimpzest is a system which enables anyone to process AI-powered analytical queries simply by defining them in a declarative language |
2025-07-23 19:20:06 |
nanonets-extractor |
0.1.4 |
A unified document extraction library supporting local CPU, GPU, and cloud processing |
2025-07-23 11:17:54 |
pdfalchemy |
0.1.0 |
A Python library for advanced PDF manipulation and processing |
2025-07-19 13:36:21 |
pdfix-sdk |
8.7.2 |
PDFix SDK - Automated PDF Remediation, Data Extraction, HTML Conversion |
2025-07-18 06:51:27 |
docx-footer-extractor |
1.0.0 |
A Python library for extracting metadata from DOCX file footers using parallel processing |
2025-07-17 11:30:20 |
motionminer |
1.0.2 |
Extract videos from Google Motion Photos with ease! |
2025-07-15 13:37:12 |
arc-file-extractor |
0.1.0 |
CLI frontend for unified file extraction on UNIX systems. |
2025-07-15 12:22:20 |
tabuparse |
0.1.0 |
A Python CLI tool for extracting, normalizing, and merging tabular data from PDF documents |
2025-07-11 16:50:57 |
ai-finance-agent |
0.1.2 |
An agentic AI library for processing financial receipts |
2025-07-10 11:29:35 |
atai-image-tool |
0.0.2 |
Extract text from images using OCR and save to JSON or print to console |
2025-02-27 05:14:23 |
pyvisionai |
0.3.1 |
A Python library for extracting and describing content from documents using Vision LLMs |
2025-02-22 22:21:47 |
scrapfly-sdk |
0.8.21 |
Scrapfly SDK for Scrapfly |
2025-01-29 14:33:40 |
pdftext |
0.5.1 |
Extract structured text from pdfs quickly |
2025-01-28 17:10:43 |
klayout-pex |
0.1.13 |
Parasitic Extraction Tool for KLayout |
2025-01-27 18:00:07 |
gimie |
0.7.2 |
Extract structured metadata from git repositories. |
2024-12-18 09:05:46 |
tifex-py |
0.1.1 |
TODO |
2024-12-08 11:55:52 |
efel |
5.7.13 |
Electrophys Feature Extract Library (eFEL) |
2024-11-26 10:12:45 |
validex |
0.0.2 |
A Python package to extract data from unstructured into structured format |
2024-11-20 20:20:58 |
llama-index-packs-amazon-product-extraction |
0.3.0 |
llama-index packs amazon_product_extraction integration |
2024-11-18 02:01:55 |